A music information system automatically generated via Web content mining techniques
نویسندگان
چکیده
0306-4573/$ see front matter 2010 Elsevier Ltd doi:10.1016/j.ipm.2010.09.002 ⇑ Corresponding author. Tel.: +43 (0) 732 2468 1 E-mail address: [email protected] (M. Sched 1 In the following, we use the term ‘‘artist” to refe This article deals with the problem ofminingmusic-related information from theWeb and representing this information via amusic information system. Novel techniques have been developed as well as existing ones refined in order to automatically gather information about music artists and bands. After searching, retrieval, and indexing of Web pages that are related to a music artist or band, Web content mining and music information retrieval techniques were applied to capture the following categories of information: similarities between music artists or bands, prototypicality of an artist or a band for a genre, descriptive properties of an artist or a band, band members and instrumentation, images of album cover artwork. Approaches to extracting these pieces of information are presented and evaluation experiments are described that investigate the proposed approaches’ performance. From the insights gained by the various experiments an Automatically Generated Music Information System (AGMIS) providing Web-based access to the extracted information has been developed. AGMIS demonstrates the feasibility of automated music information systems on a large collection of more than 600,000 music artists. 2010 Elsevier Ltd. All rights reserved.
منابع مشابه
Towards an Automatically Generated Music Information System Via Web Content Mining
This paper presents first steps towards building a music information system like last.fm, but with the major difference that the data is automatically retrieved from the WWW using web content mining techniques. We first review approaches to some major problems of music information retrieval (MIR), which are required to achieve the ultimate aim, and we illustrate how these approaches can be put ...
متن کاملAccessing Music Collections Via Representative Cluster Prototypes in a Hierarchical Organization Scheme
This paper addresses the issue of automatically organizing a possibly large music collection for intuitive access. We present an approach to cluster tracks in a hierarchical manner and to automatically find representative pieces of music for each cluster on each hierarchy level. To this end, audio signal-based features are complemented with features derived via Web content mining in a novel way...
متن کاملAutomatically Detecting Members and Instrumentation of Music Bands Via Web Content Mining
In this paper, we present an approach to automatically detecting music band members and instrumentation using web content mining techniques. To this end, we combine a named entity detection method with a rule-based linguistic text analysis approach extended by a rule filtering step. We report on the results of different evaluation experiments carried out on two test collections of bands coverin...
متن کاملDiscovery of Interesting Usage Patterns from Web Data
Web Usage Mining is the application of data mining techniques to large Web data repositories in order to extract usage patterns. As with many data mining application domains, the identiication of patterns that are considered interesting is a problem that must be solved in addition to simply generating them. A necessary step in identifying interesting results is quantifying what is considered un...
متن کاملFrom Sound to ‘Sense’ via Feature Extraction and Machine Learning: Deriving High-Level Descriptors for Characterising Music
Research in intelligent music processing is experiencing an enormous boost these days due to the emergence of the new application and research field of Music Information Retrieval (MIR). The rapid growth of digital music collections and the concomitant shift of the music market towards digital music distribution urgently call for intelligent computational support in the automated handling of la...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Inf. Process. Manage.
دوره 47 شماره
صفحات -
تاریخ انتشار 2011